Skip to main content

Job Opportunities at the Internet Archive

    Manager: Operations and Infrastructure

    Partner Associate, Archive-It

    Senior Application Developer: Archive.org

    Senior Engineer: Wayback Machine

    Storage Manager for the Physical Archive

    Web Archiving Software Engineer

    About the Internet Archive

    The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge the books, web pages, audio, television and software of our shared human culture. Forever. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where anyone can go to learn and explore. Our 160 engineers, book scanners, librarians, designers and team members have built the 250+ most popular website in the world. (https://archive.org) Internet Archive is a non-profit digital library offering free universal access to books, movies & music, as well as 464 billion archived web pages.

    Manager: Operations and Infrastructure

    Location: Inner Richmond, San Francisco, CA and City of Richmond, CA ON-SITE PRESENCE IN SF/RICHMOND IS REQUIRED! Remote employment not available for this position.

    Job Classification: Full-time, exempt

    Job Summary: The Internet Archive has over 25PB of unique digital information all running across an integrated cluster of over 700 VMs on 500+ bare-metal hosts in 3 data centers. We are looking for a smart operations manager and engineer with proven experience effectively managing a high-performance team of system administrators and technical operations staff. The ideal candidate will have solid demonstrated experience improving and maintaining the reliability and performance of both internal and publicly facing web infrastructure, online services, information repositories, and database systems. They must also be skilled in management communications and able to work collaboratively with our teams of talented engineers and program staff.

    Essential Job Functions:
    • Manage and mentor the technical and devop team responsible for monitoring, maintaining, and restoring the health of all Internet Archive online services. This includes all publicly-facing service, the storage and compute cluster, as well as key internal services related to crawling, indexing, and access to archived web content
    • Maintain and expand monitoring and reporting systems to communicate current and historical activity for multiple publicly facing Services and to ensure service continuity and performance.
    • Analyze, implement, and manage effective improvements in the maintenance and operations processes and infrastructure.
    • Assignment, recruiting, hiring, and terminating of staff as needed to sustain operational objectives and efficiency.
    • Recommend the purchase of equipment needed to sustain responsive services and cost-effective operations.

    Minimum Qualifications:

    • Experience managing large server cluster infrastructure
    • Experience as lead manager and mentor of a technical operations team
    • Passion and fierce advocate for the end user experience of web-delivered services
    • Experience in highly available 24x7 production environment.
    • Passion for automation, data-driven decision making, and information reporting
    • Experience with high-bandwidth networking environments
    • Work history that includes production-level programming in high-transaction environments.
    • Fluency in Linux system administration, Unix shell scripting, and Python, PHP, etc.
    • Extensive experience with Ansible, Git, Nagios, Postgres, Redis, ELK stack, etc.
    • Experience deploying and administering database, search, and web-host services
    • Be an excellent and creative problem solver. You do not need to know everything but you need to know how to find the solution.
    • Experienced in open source practices and passion for staying current with industry trends
    • Excellent oral/written communication and documentation skills
    • Willingness to travel to network operation centers and participate as necessary in physical equipment install
    • BS Computer Science, or equivalent work experience

    Preferred Qualifications:
    • Experience deploying and maintaining big-data analytics tools, especially Hadoop, Druid, or RethinkDB
    • MS in Computer Science or equivalent work experience
    • Flexibility and a sense of humor

    Reporting Structure: The Manager: Operations and Infrastructure reports to the Director of Engineering and works closely with the Head Librarian and Founder.

    To Apply: Please send your resume and cover letter to jobs+manageroperationsengineering@archive.org with the subject line "CI-402: Manager of Operations and Infrastructure."

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

    AE-308: Senior Application Developer: Archive.org

    Location: San Francisco, CA

    Job Classification: Full-time, exempt

    Job Summary: The Internet Archive has a huge corpus of digital information. Every day, our team of development engineers creates tools and applications that help our users to access and work with 22 petabytes of content that includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for smart engineers to help develop next generation of web-based applications and tools that will be used by libraries and archives around the world to build and manage curated collections of books, texts, web, and image content. The ideal candidate will be a strong programmer who has successfully led and completed several projects involving large or intricate web applications or services, and who works collaboratively with talented engineering colleagues.

    Key Responsibilities:
    • The responsibilities of this position are to be part of the team that will maintain and evolve the Archive.org web site. More specifically, this means:
    • Work at the direction of the technical project lead to continue to evolve and enhance the next generation of the archive.org web site.

    Minimum Qualifications:

    • Passion for delivering delightful end-user experiences when interacting with delivered web applications and services.
    • Extensive work experience with Javascript, HTML5, and CSS.
    • Extensive experience developing applications and websites in PHP
    • Work history that includes integrating front end user interfaces with search, database , and business logic to create integrated applications and services.
    • Experience working with digital media files and metadata structures
    • Experience developing and maintaining structured APIs
    • Good understanding of latest web framework technologies and protocols
    • Fluency in Linux environments
    • Flexibility and a sense of humor

    Preferred Qualifications:
    • Strong programming experience Python.
    • Experience open source practices and participation in open source forums
    • Experience working with time-based digital media (audio and video).
    • Specific experience with Atlassian tool sets (Jira, Confluence)

    Reporting Structure:The Web Application Developer reports to the Director of Engineering and will work closely with the web archiving and TV archiving teams. The entire staff is guided by founder and Digital Librarian, Brewster Kahle.

    To Apply:Please send your resume and cover letter to Jobs+Seniorapplicationdeveloper@archive.org with the subject line "AE-106: Web Application Developer."

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

    Senior Engineer: Wayback Machine

    Location: San Francisco, CA

    Job Classification: Full-time, exempt

    Job Summary:The Internet Archive's Wayback Machine is the world's largest public archive of historical web sites. Have you ever wanted to work with 450 billion things at once? Would you like to serve 1,500 requests per second? How about having your service referred to regularly in news articles and blog posts across the web? You can work on a challenging and popular project and help the world at the same time.

    We are looking for a smart, collaborative and resourceful engineer to help develop the next version of the Wayback Machine. The ideal candidate will possess a desire to work collaboratively with a small internal team and a large, vocal and active user community; demonstrating independence, creativity, initiative and technological savvy, in addition to being a great programmer/architect.

    Minimum Qualifications:

    • 2-3 years work experience in Python, or similar
    • Experience working in Linux environments
    • Familiarity with Java (current deployment is written in Java)
    • Good understanding of latest web framework technologies and aspects of web technology and protocols
    • Flexibility and a sense of humor
    • BS Computer Science, or equivalent work experience

    Preferred Qualifications:

    • Experience with web crawlers and/or applications designed to display archived web content (especially server-side apps)
    • Cluster computing experience
    • Open source practices experience

    To Apply: Please send your resume and cover letter to Jobs+SeniorWaybackEngineer@archive.org with the subject line "Wayback Machine Senior Engineer."

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

    Partner Associate, Archive-It

    Job Classification: Full-time, exempt

    Location: San Francisco, CA

    Reports To: Director, Web Archiving Programs

    Job Summary: This position serves as the primary contact for the on-boarding, finance, account maintenance, and administrative needs of Archive-It, which is Internet Archives fast-growing subscription web archiving service. Archive-It is used by over 350+ institutions to build archives of web materials for long-term preservation and access. This position also supports other invoicing, reporting, and administrative needs for the overall Web Archiving Programs team. The Partner Associate will work with the Director and program staff to address and manage new Archive-It partner on-boarding, contracts and invoicing, general finances and partner support, and maintain the groups CRM and other internal systems. The role is a crucial point of contact with partners to facilitate the process of joining Archive-It, to help maintain the service's high renewal rate, to work with team-members to resolve partner account issues, and to participate in other financial, reporting, relationship and project management projects and tasks.

    Essential Job Responsibilities:
    • Works with the Director to assist new Archive-It institutional partners through all stages of the on-boarding process, including service agreements and other account information.
    • Manages other aspects of the on-boarding process, including trainings, account creation, and handling associated partner communications and needs
    • Prepares, sends out, and tracks all invoices related to partner billing.
    • Handles subscription renewal workflow and works with the Finance department on resolving invoicing, contract, and payment issues.
    • Works extensively the SalesForce CRM system to maintain subscription, contact, and other key partner information and maintains the filing system for contracts, grant materials, and other budget tracking documentation.
    • Works on special projects and other duties as assigned by the Director.

    Minimum Qualifications:

    • Two years experience in partner services and support, financial or customer relations, especially in client on-boarding and billing.
    • Strong attention to detail, excellent organizational skills and the ability to juggle the needs of several partners at any time.
    • Excellent written and oral communication skills are a must, as the role has significant interaction with new and existing partners and users.
    • Experience with SalesForce is preferred.
    • Degree in accounting helpful, but not required.
    • Must be able to manage multiple projects, deadlines, and responsibilities.
    • The Partner Associate will view, have access to, and work with confidential data. They must adhere to strict confidentiality policies and procedures.
    • Ability to work in and enjoy a team work environment that includes staff working remotely.

    To Apply: Please forward your cover letter and resume, with "Partner Associate" in the subject line, to jobs+partnerassociate-archiveit@archive.org

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

    Storage Manager for the Physical Archive

    Location: IA Warehouses, Richmond, CA

    Job Classification: Full-time, exempt

    Job Summary: As caretaker and advocate for the Physical Archive, the Storage Manager leads the Archive's visionary efforts to preserve one copy of every published book known to humankind for perpetuity, as well as providing a home for special collections of donated texts, music, and film. The Storage Manager oversees daily warehouse staff and workflow to ensure the continuous growth and improvement of the Physical Archive, and manages the storage and digital inventory of its contents. The Storage Manager stewards all onsite public and scholarly access to the physical collection, and educates various communities as needed to support the mission and objectives of the Archive.

    Essential Job Functions:

    • Directly responsible for the day-to-day running of the warehouse, including schedule management, workflow requirements, data input, and problem solving.
    • Oversees all shipment logistics to ensure effective inventory control and to ensure processes are completed in coordination with all partners and digitization centers worldwide.
    • Trains, supervises and provides guidance and leadership to the warehouse team to ensure that all staff perform their duties safely, effectively, and efficiently.
    • Provides timely, accurate, and professional service to all parties visiting and/or accessing the Physical Archive.
    • Ensures all work areas and warehouse equipment are well-maintained, clean and safe.
    • Maintains accurate records and adheres to all facility security, maintenance, and Health & Safety requirements.

    Minimum Requirements:

    • 5 years of hands-on experience managing staff and operations in a warehouse or offsite repository environment.
    • Track record of success in streamlining processes and implementing quality improvements in warehouse operations and inventory management.
    • Some background with developing and maintaining warehouse conditions involving temperature-controlled vaults and other specialist storage and conservation technologies.
    • Familiarity with developing and maintaining digital inventories of physical collections, including computer/technical skills with some capacity to develop and use tools such as backend scripts, web interfaces and APIs.
    • Some background and interest in library science and/or museum or archive collections development is ideal.
    • Excellent communications skills.
    • Able to perform in all tasks performed by floor staff, including some experience or interest in using forklift and pallet jack.
    • Willing and physically able to frequently stand, walk, kneel, bend or climb with or without reasonable assistance; willing and able to lift, push and pull up to 50lbs.
    • Commitment to the greater good and to doing work that has a beneficial long-term impact on society.
    • Flexibility and a sense of humor are essential.

    To Apply: Please send your resume and cover letter via email to jobs+storagemgr@archive.org with "Storage Manager" in the subject line.

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) nonprofit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

    Web Archiving Software Engineer

    Location: Inner Richmond, San Francisco, CA or Remote

    Job Classification: Full-time, Exempt

    Job Summary: The Internet Archive has over 24PB of unique digital information, all running across an integrated cluster of over 700 VMs on 500+ bare-metal hosts in 3 data centers. We are looking for a smart engineer with experience in defining and building service APIs. The ideal candidate will also have experience creating software that interacts with systems at high transaction rates while delivering reliability and performance of both internal and public-facing web applications. All candidates must be able to work collaboratively within our Web Archiving team of talented engineers and program staff.

    Essential Job Functions:

    • Build, test, and package APIs for the transfer of data out of a repository of web archive files
    • Consume external APIs to enable the ingest of external data into web archive files
    • Deploy, administer, and tune tools that support the software development infrastructure and data management and processing environments used within the Web Archiving group
    • Analyze, manage, transfer, and maintain large amounts of archival data in multiple environments
    • Participate in monitoring, maintaining, and restoring the health of the storage and computer cluster and key processes and services related to crawling, indexing, and access to archived web content

    Minimum Qualifications:

    • Fluency in Linux environments, scripting and/or programming skills, development of custom tool integrations
    • Proven experience in Unix shell scripting and Python required
    • Demonstrated experience building or working with APIs
    • Experience deploying and administering database, search, and web-host services
    • Proven experience open source practices, participation in open source forums, and staying current with industry trends
    • BS in Computer Science, or equivalent work experience

    Preferred Qualifications:

    • Familiarity configuration of software development environments and cluster administration tools, including Git, ELK stack and monitoring tools: Nagios, Graphite, Grafana, etc
    • Knowledge of evolving database or analytics tools, especially Hadoop, Druid, or RethinkDB
    • Experience or familiarity with Java is a plus
    • Experience with Atlassian tool sets
    • MS in Computer Science or equivalent work experience
    • Flexibility and a sense of humor

    Reporting Structure: The Web Archiving Software Engineer reports to the Director of Engineering and works closely with the Director, Web Archiving Programs. The position will also work alongside other systems, applications, and QA engineers as well as program staff in Web Archiving Programs team.

    To Apply: Please send your resume and cover letter to jobs+webarchivingengineer@archive.org with the subject line "Web Archiving Software Engineer."

    Internet Archive reserves the right to revise job descriptions or work hours as required.

    Internet Archive is an Equal Opportunity Employer and a 501(c)(3) non profit library founded in 1996.

    The Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.